Genes for prognosis of cancer

ABSTRACT

To provide a novel method for determining the risk of lymph node metastasis of breast cancer uses as an index the difference in the expression levels of marker genes between metastatic breast cancer tissue (or cell) and non-metastatic breast cancer tissue (or cell). The method involves ( 1 ) measuring the expression level of a gene having a base sequence in human metastatic breast cancer tissue (or cell), ( 2 ) measuring the expression level of the same gene in human non-metastatic breast cancer tissue (or cell), and ( 3 ) comparing the expression level of ( 1 ) with the expression level of ( 2 ) and determine the risk of lymph node metastasis of breast cancer based on the difference between the expression levels.

CROSS-REFERENCE TO RELATED APPLICATION

This is a continuation of Application No. PCT/JP2007/051800, filed on Feb. 2, 2007.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a novel method for determining the risk of lymph node metastasis of breast cancer. More specifically, the present invention relates to a method for determining the risk of lymph node metastasis of breast cancer that is based on comparison of the expression levels of marker genes having specific base sequences between metastatic breast cancer cells and non-metastatic breast cancer cells.

2. Description of the Related Art

In Japan, the number of breast cancer patients is growing rapidly. The cancer is the most prevalent of all cancers in women. Estrogen, a female hormone, is considered a risk factor of breast cancer: women who have been exposed to estrogen for a longer period of time due to early menarche, late menopause, late age at first birth or nulliparity are more likely to develop breast cancer. Western-style high-fat diet and obesity are also associated with this type of cancer since estrogen is primarily produced in fat tissue in postmenopausal women. The changing lifestyles of Japanese women, such as their active participation in society, also contribute to the increase in the incidence of breast cancer.

Breast cancer is generally divided into three classes: non-invasive carcinomas, invasive carcinomas and Paget's disease of the breast. Most of the incidences of breast cancer that form lumps are invasive. There are common and special types of invasive breast cancers. The common types include scirrhoma, papillotubular carcinoma and solid-tubular carcinoma. The special types include mucinous carcinoma.

Because no blood test is available to specifically detect breast cancer, the detection of early breast cancers relies primarily on palpation and X-ray imaging. However, these techniques, even when used in combination, fail to detect as much as 20% of the cancer. In addition, diagnosis by X-ray imaging often requires specialists. The cytodiagnosis conducted before and during the surgical procedures can only be done by a pathologist and is often difficult due to the shortage of experienced pathologists and varying standards of the diagnosis. Thus, no subjective and simple technique for the detection/diagnosis of early breast cancers has ever existed to bridge the gap between detection and diagnosis of the disease. The PET analysis, a new diagnostic technique that can detect tumor tissue 1 mm or less in diameter, requires large-scale facilities and is therefore not readily used for the detection of breast cancer.

Recent studies have shown that cancers are caused by anomalies in genes. For example, techniques have been proposed that detect cancer cells by making use of the fact that certain genes are expressed at different levels in a cancer tissue and a normal tissue (Japanese Patent Application Laid-Open (JP-A) Nos. 2003-284594 and 2003-284596).

BRIEF SUMMARY OF THE INVENTION

Once lymph node-metastatic breast cancer has been removed by surgery, prognosis is predicted based on indices such as tumor size, nuclear pleomorphism of the removed cancer cells and of hormone receptor levels. Where necessary, adjuvant therapy is given to prevent metastasis to lymph nodes or the recurrence of cancer. The prediction of prognosis based on these presently available indices is not accurate enough, however, and more accurate indices for the prognosis of breast cancer patients are therefore needed to reduce the risk of recurrence and improve patients' quality of life by proper medication.

In view of the above-described problems, the present inventors have conducted extensive studies and observed that certain marker genes are expressed at different levels in metastatic breast cancer cells or tissues and in non-metastatic breast cancer cells or tissues. The present inventors found that these marker genes could be used to determine the risk of lymph node metastasis of breast cancer and ultimately devised the present invention. Accordingly, the present invention provides the following measures to address the above-described problems.

(1) A method for determining the risk of lymph node metastasis of breast cancer, including using as an index the difference in the expression level of a marker gene between a metastatic breast cancer tissue (or cell) and a non-metastatic breast cancer tissue (or cell).

(2) The method according to (1) above, wherein the expression level of the marker gene is determined by the amount of mRNA of the gene.

(3) The method according to (1) or (2) above, wherein the marker gene is at least one selected from the group consisting of genes having base sequences of GenBank accession Nos. NM000903, NM006804, NM033547, CR611676, NM177967, NM152558, NM178167, NM003752, AK131568, CR592336, NM178507, NM002862, NM006913, NM005794, NM014164, NM000853 and a base sequence extending from 178882962bp to 178883181bp of chromosome 3, and homologs thereof.

(4) The method according to any one of (1) to (3) above, wherein the expression level of the marker gene in the metastatic breast cancer tissue (or cell) is equal to or higher than twice the expression level in the non-metastatic breast cancer tissue (cells), or equal to or lower than one-half the expression level in the non-metastatic breast cancer tissue.

The method of the present invention enables quick and simple determination of the risk of lymph node metastasis of breast cancer at the genetic level, thus providing an effective way to prevent metastasis of breast cancer.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

FIG. 1 is a diagram showing a comparison of the expression levels of a transcript (transcript 1) of a marker gene according to high-coverage gene expression profiling (HiCEP).

FIG. 2 is a diagram showing a comparison of the expression levels of a transcript (transcript 2) of another marker gene according to HiCEP.

FIG. 3 is a diagram showing a comparison of the expression levels of a transcript (transcript 3) of another marker gene according to HiCEP.

FIG. 4 is a diagram showing a comparison of the expression levels of a transcript (transcript 4) of another marker gene according to HiCEP.

FIG. 5 is a diagram showing a comparison of the expression levels of a transcript (transcript 5) of another marker gene according to HiCEP.

FIG. 6 is a diagram showing a comparison of the expression levels of a transcript (transcript 6) of another marker gene according to HiCEP.

FIG. 7 is a diagram showing a comparison of the expression levels of a transcript (transcript 7) of another marker gene according to HiCEP.

FIG. 8 is a diagram showing a comparison of the expression levels of a transcript (transcript 8) of another marker gene according to HiCEP.

FIG. 9 is a diagram showing a comparison of the expression levels of a transcript (transcript 9) of another marker gene according to HiCEP.

FIG. 10 is a diagram showing a comparison of the expression levels of a transcript (transcript 10) of another marker gene according to HiCEP.

FIG. 11 is a diagram showing a comparison of the expression levels of a transcript (transcript 11) of another marker gene according to HiCEP.

FIG. 12 is a diagram showing a comparison of the expression levels of a transcript (transcript 12) of another marker gene according to HiCEP.

FIG. 13 is a diagram showing a comparison of the expression levels of a transcript (transcript 13) of another marker gene according to HiCEP.

FIG. 14 is a diagram showing a comparison of the expression levels of a transcript (transcript 14) of another marker gene according to HiCEP.

FIG. 15 is a diagram showing a comparison of the expression levels of a transcript (transcript 15) of another marker gene according to HiCEP.

FIG. 16 is a diagram showing a comparison of the expression levels of a transcript (transcript 16) of another marker gene according to HiCEP.

FIG. 17 is a diagram showing a comparison of the expression levels of a transcript (transcript 17) of another marker gene according to HiCEP.

DETAILED DESCRIPTION OF THE INVENTION

The present invention concerns a method for determining the risk of lymph node metastasis of breast cancer that uses as an index of the risk of metastasis the difference in the expression levels of specific marker genes between metastatic breast cancer cells or tissues and non-metastatic breast cancer cells or tissues. As used herein, the term “marker gene” refers to a gene that enables the determination of the risk of metastasis of breast cancer cells by comparing its expression levels between metastatic breast cancer cells or tissues and non-metastatic breast cancer cells or tissues.

The present invention also concerns a method for determining the risk of lymph node metastasis of breast cancer in which the expression levels of the marker genes are determined by the amounts of mRNA of the marker genes. More specifically, the present invention concerns a method for determining the risk of lymph node metastasis of breast cancer that involves extracting total RNA from cells obtained from metastatic and non-metastatic breast cancer tissues, and comparing the amounts of mRNA transcripts transcribed from the marker genes. Different techniques for gene expression analysis can be used to determine the amounts of mRNA of genes of interest, including comprehensive transcriptome analysis (high-coverage gene expression profiling, HiCEP), DNA microarrays, quantitative RT-PCR and northern hybridization. Gene expression analysis techniques that can determine the amounts of mRNA without extracting total RNA from cells, such as in situ hybridization, may also be used in the present invention. The above-described techniques may be used in combination to improve the accuracy of detection. The translated products of the genes of the present invention may also be quantified by, for example, determining the amounts of proteins coded by the genes. Proteins of interest can be quantified by using techniques such as immunological assays using antibodies specific for the proteins (such as ELISA, western blotting and RIA), two-dimensional electrophoresis and high-performance liquid chromatography (HPLC). Antibodies specific for the proteins coded by the genes of the present invention can be prepared by common techniques using the proteins coded by the genes as antigens.

HiCEP is one of the transcriptome analysis techniques and is characterized by its comprehensiveness and high sensitivity. The following is a brief overview of the technique (See Nucleic Acids Res., 2003, Vol.31, No.16 e94 for more details): Using common techniques, total RNA is extracted and purified from tissue or cell samples. Double-stranded cDNA is synthesized from the total RNA using 5′-biotinylated oligo(dT) primers. The cDNA is then digested with a restriction enzyme MspI. Poly(A)-containing fragments are collected by avidin beads and 3′-adaptor is ligated to the MspI-digested ends of the collected fragments. The fragments are then digested with a restriction enzyme MseI and 3′-adapter is ligated to the MseI-digested ends. PCR primers are constructed by adding all possible combinations of two selected bases to the same adapter sequences as those ligated to 5′ and 3′ ends (16 5′-end primers and 16 3′-end primers with 5′-end primers fluorescent-labeled). Using these primers, 256 different quantitative PCRs are performed. The PCR products obtained for each primer pair are loaded on a fragment analyzer to obtain 256 electrophoresis profiles (gene expression profiles), each containing multiple fluorescence peaks, for a sample. The expression levels of transcripts can then be compared by comparing the fluorescence peaks among different samples.

The marker gene for use in the present invention may be any gene that is expressed at significantly different levels between metastatic breast cancer cells or tissues and non-metastatic breast cancer cells or tissues. For example, the marker gene may be at least one selected from the group consisting of genes having base sequences of GenBank accession Nos. NM000903, NM006804, NM033547, CR611676, NM177967, NM152558, NM178167, NM003752, AK131568, CR592336, NM178507, NM002862, NM006913, NM005794, NM014164 and NM000853 and a base sequence extending from 178882962bp to 178883181bp of chromosome 3, and homologs thereof.

Data stored in the GenBank database may contain the same gene registered by different researchers, at different times, in different fields and under different names or gene polymorphisms or splicing variants of the same gene registered as novel genes. Thus, different base sequences that can be considered to be originated from a single gene may be registered with different accession numbers. These base sequences are collectively referred to as “homologs”. The term is used in the same context throughout this specification.

One characteristic feature of the method of the present invention for determining the risk of lymph node metastasis of breast cancer is the use of marker genes that are expressed at significantly different expression levels between metastatic breast cancer cells or tissues and non-metastatic breast cancer cells or tissues. The term “expression level” may refer to either the amount of mRNA transcribed from a marker gene or the amount of a protein translated from mRNA. With regard to the difference in the expression level of a marker gene between metastatic breast cancer cells or tissues and non-metastatic breast cancer cells or tissues, the ratio of the expression level of a marker gene in non-metastatic breast cancer cells or tissues to the expression level of the same gene in metastatic breast cancer cells or tissues is preferably in the range of 1.5 or higher or ⅔ or lower, and more preferably in the range of 2 or higher or ½ or lower. A marker gene does not serve as an accurate index of the risk of lymph node metastasis of breast cancer and is therefore not desirable when the ratio of its expression level in non-metastatic breast cancer cells or tissues to that in metastatic breast cancer cells or tissues is outside the above-described range.

While the method of the present invention can be applied to any type of breast cancer, including breast ductal carcinomas (such as papillotubular carcinoma, solid-tubular carcinoma and scirrhoma), lobular carcinomas, special-type carcinomas (such as mucinous carcinoma, medullary carcinoma and tubular carcinoma) and Paget's disease of the breast, it is preferably applied to scirrhoma, lobular carcinomas or solid-tubular carcinoma.

EXAMPLE

The present invention will now be described with reference to Example, which is not intended to limit the scope of the invention in any way.

In this Example, the expression levels (RNA transcription levels) of different genes are compared between human metastatic breast cancer tissue and non-metastatic tissue using one of the gene expression analysis techniques known as high-coverage gene expression profiling technique (HiCEP), a known comprehensive, highly sensitive technique for transcriptome analysis (Nucleic Acids Res., 2003, vol.31(16), e94).

The breast cancer tissues used in Example were shown in Table 1. The tissues were collected from five stage II breast cancer patients (commercial products, all Caucasian, primary tumor, lymph node metastasis (2), no lymph node metastasis (3), all had stage II cancer based on TNM classification).

TABLE 1 Samples Metastasis Tumor size Age #A + 12 cm 57 #B + 2 cm × 1.5 cm × 1.5 cm 69 #C − 2.5 cm 50 #D − 4 cm × 2 cm × 1.7 cm 61 #E − 6 cm × 5.5 cm × 4.5 cm 68

Total RNA was extracted from the samples by a common kit technique using RNeasy kit (Qiagen). 0.1 μg of total RNA from each sample was used as template and reverse-transcribed using Super Script First Strand Synthesis System for RT-PCR (Invitrogen). The reverse transcript was incubated with DNA polymerase I (80 units), RNAase H (4 units, Invitrogen) and E. coli DNA ligase (40 units, Invitrogen) at 16° C. for 2 hours. The resulting double-stranded DNA was incubated with restriction enzymes Mse I (40 units, New England Biolabs) and Msp I (50 units, TaKaRa Bio) at 37° C. for 4 hours. Adaptor sequences were ligated to the ends of the resulting DNA fragments. Selective PCRs were performed using the adaptor-ligated DNA fragments as templates. The amplified products were analyzed by capillary electrophoresis. The waveform data were used to determine gene expression levels, compare the gene expression levels among the samples, and classify the genes into different expression patterns to obtain data for clustering (expression variation peaks).

The results of the analysis shown in Table 2 and FIGS. 1 through 17 demonstrate that the difference in the fluorescence peak intensity between samples obtained from patients with lymph node metastasis and samples obtained from patients with no metastasis was significant for each of the 17 marker gene transcripts. In this analysis, each sample was assayed in two replicates and the resulting fluorescence peaks were overlapped. Arrows indicate the peaks for the marker gene transcripts.

Specifically, Transcripts 1 through 11 (as numbered in Table 1) each show a significant fluorescence peak in each of the metastasis samples but show no expression peak or, if any, a peak intensity that is half or less of the peak intensity of the metastasis samples in each of the non-metastasis samples. Conversely, Transcripts 12 through 17 each show a significant fluorescence peak in each of the non-metastasis samples but show no expression peak or, if any, a peak intensity that is half or less of the peak intensity of the non-metastasis samples in each of the metastasis samples. These observations demonstrate that each of the 17 genes can serve as an index of the risk of breast cancer metastasis that allows the determination of the risk of metastasis based on their expression levels.

TABLE 2 Transcripts as markers for breast cancer metastasis GenBank Transcripts Sequences Accession No. Annotation Characteristics #1 Transcript sequence NM_000903 NAD(P)H menadione Experssion containing a base oxidoreductase 1 enhanced in sequence from 68302490 bp metastatic to 68317861 bp of (−) strand breast cancer of chromosome 16 #2 Transcript sequence N/A N/A containing a base sequence from 178882962 bp to 178883181 bp of (+) strand #3 Transcript sequence NM_006804 steroidogenic acute containing a base regulatory protein sequence from 35050592 bp related to 35050643 bp of (+) strand of chromosome 17 #4 Transcript sequence NM_033547 Homo sapiens containing a base hypothetical gene sequence from77267542 bp MGC16733 similar to to 77272569 bp of (−) strand CG12113 (MGC16733), of chromosome 11 mRNA. #5 Transcript sequence CR611676 Similar to Px19-like containing a base protein (25 kDa protein sequence of relevant evolutionary from176665540 bp to and lymphoid interest) 176666255 bp of (+) strand (PRELI) (CGI-106) of chromosome 5 (SBBI12) #6 Transcript sequence NM_177967 Phosphoglycerate containing a base dehydrogenase like 1 sequence from 98835662 bp to 98835862 bp of (+) strand of chromosome 13 #7 Transcript sequence NM_152558 IQ motif containing E containing a base (IQCE) sequence from 2426581 bp to 2426860 bp of (+) strand of chromosome 7 #8 Transcript sequence NM_178167 Zinc finger protein containing a base 598 sequence from 1987788 bp to 1987865 bp of (−) strand of chromosome 16 #9 Transcript sequence NM_003752 Eukaryotic containing a base translation initiation sequence from 228320033 bp factor 3, subunit 8, to 28320077 bp of (−) 110 kDa strand of chromosome 16 #10 Transcript sequence AK131568 V-erb-b2 erythroblastic containing a base leukemia viral oncogene sequence from 35135544 bp homolog 2, to 35135831 bp of (+) neuro/glioblastoma derived strand of chromosome 17 oncogene homolog (avian) #11 Transcript sequence CR592336 V-erb-b2 erythroblastic containing a base leukemia viral oncogene sequence from 35126382 bp homolog 2, to 35127393 bp of (+) neuro/glioblastoma strand of chromosome 17 derived oncogene homolog (avian) #12 Transcript sequence NM_178507 NS5ATP13TP2 Expression containing a base protein decreased in sequence from119605680 bp metastatic to 119605847 bp of (+) breast cancer strand of chromosome 11 #13 Transcript sequence NM_002862 Phosphorylase, containing a base glycogen; brain sequence from 25226174 bp to 25226624 bp of (+) strand of chromosome 20 #14 Transcript sequence NM_006913 Ring finger protein 5 containing a base sequence from 32256007 bp to 32256297 bp of (+) strand of chromosome 6 #15 Transcript sequence NM_005794 Dehydrogenase/reductase containing a base (SDR family) sequence from 23183541 bp member 2 to 23184510 bp of (−) strand of chromosome 14 #16 Transcript sequence NM_014164 FXYD domain containing a base containing ion sequence from 40352503 bp transport regulator 5 to 40352595 bp of (+) strand of chromosome 19 #17 Transcript sequence NM_000853 Glutathione S- containing a base transferase theta 1 sequence from 22700873 bp to 22700983 bp of (+) strand of chromosome 22

Genes according to the present invention enable the highly sensitive and subjective, yet simple and quick determination of lymph node metastasis of breast cancer, a task that has never been achieved by any of the conventional techniques. The genes of the present invention therefore serve as markers for the prognosis of breast cancer. 

1. A method for determining the risk of lymph node metastasis of breast cancer, comprising: using as an index the difference in the expression level of a marker gene between a metastatic breast cancer tissue (or cell) and a non-metastatic breast cancer tissue (or cell).
 2. The method according to claim 1, wherein the expression level of the marker gene is determined by the amount of mRNA of the gene.
 3. The method according to claim 1, wherein the marker gene is at least one selected from the group consisting of genes having base sequences of GenBank accession Nos. NM000903, NM006804, NM033547, CR611676, NM177967, NM152558, NM178167, NM003752, AK131568, CR592336, NM178507, NM002862, NM006913, NM005794, NM014164, NM000853 and a base sequence extending from 178882962bp to 178883181bp of chromosome 3, and homologs thereof.
 4. The method according to claim 1, wherein the expression level of the marker gene in the metastatic breast cancer tissue (or cell) is equal to or higher than twice the expression level in the non-metastatic breast cancer tissue (cells), or equal to or lower than one-half the expression level in the non-metastatic breast cancer tissue. 